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SCALABLE VOICE OVER IP SYSTEM CONFIGURED FOR DYNAMICALLY 
SWITCHING CODECS DURING A CALL 



BACKGROUND OF THE INVENTION 



FIELD OF THE INVENTION 

The present invention relates to systems configured for providing advanced telephony-type 
services for subscribers in a voice over Internet Protocol (IP) nelAvork according to H,323 protocol. 

DESCRIPTION OF THE RELATED ART 

The evolution of the public sw^itched telephone network has resulted in a variety of voice 
applications and services that can be provided to individual subscribers and business subscribers. 
Such services include voice messaging systems that enable landline or wireless subscribers to record, 
playback, and forward voice mail messages. However, the ability to provide enhanced services to 
subscribers of the public switched telephone network is directly affected by the limitations of the 
public switched telephone network. In particular, the public switched telephone network operates 
according to a protocol that is specifically designed for the transport of voice signals; hence any 
modifications necessary to provide enhanced services can only be done by switch vendors that have 
sufficient know-how of the existing public switched telephone network infrastructure. Hence, the 
reliance on proprietary protocols and closed development environments by telecommunications 
equipment providers has limited service providers to vendor-specific implementations of voice and 
telephony services. 

Voice over IP technology is under development as part of an alternative open packet 
telephony communications network, distinct from the public (circuit switched) telephone network, 
capable of using packet switched networks for integrating voice, data, facsimile, and Internet 
services, and the like. New packet telephony voice services are being built from open standards such 
as The International Telecommunications Union (ITU) Recommendation H.323. Recommendation 
H.323 defines the components, procedures, and protocols necessary to provide audiovisual 
communications on local area networks. Recommendation H.323 is based on the Real Time 
Protocol/Control Protocol (RTP/RTCP) of the Intemet Engineering Task Force (IETF), and applies 
to either point-to-point or multipoint sessions, and references many other ITU recommendations, 
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including H. 225 and H.245. Recommendation H.22 5 specifies messages for call control including 
signaling, registration and admissions, and packetization/synchronization of media streams. 
Recommendation H.245 specifies messages for opening and closing channels for media streams, and 
other commands, requests and indications. 

However, the existing H.323 standard does not specify features that provide flexibility for 
deployment of an IP-based system that supports advanced application services such as unified 
messaging. In particular, unified messaging involves presenting subscribers with messages in a 
prescribed format, even if the messages are stored in a different, incompatible format. For example, 
a messaging subscriber accessing a unified messaging system via telephone may rely on a text to 
speech converter to convert text-based messages such as e-mail messages into speech. Existing text 
to speech software processes currently have native support only for G.71 1 codecs: G.711 is an 
intemational standard for encoding telephone audio on a 64 kbps chaimel. However, G.729 (or 
G.729A) and G.723 are alternative standards for encoding toll quality telephone audio on an 8 kbps 
channel and 6.4 kbps channel, respectively. Hence, if a service provider wishes to deploy a voice 
over IP network that takes advantage of compression schemes such as G.729A or G.723 to conserve 
resources such as disk space for voice messages, the unified messaging system needs to support 
multiple codecs in a scalable manner. 

SUMMARY OF THE INVENTION 

There is a need for enhanced IP -based coinmunication services that enable a services 
subscriber to utilize a user interface application, such as a unified messaging application or a voice 
activated dialing application, that dynamically switches between codecs in order to support 
applications configured for operating using the different codecs. 

There is also a need for an arrangement that enables an IP telephony gateway to close an 
existing media stream configured for operating at a first compression rate on an existing connection 
and open on the existing connection a new media stream operating at a second compression rate. 

These and other needs are attained by the present invention, where an IP telephony gateway 
and a media server are configured for changing media streams during an existing call according to 
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voice over IP (H.323) protocol, v^here a first media channel configured for transmitting a first media 
stream at a corresponding first compression is closed and a second media channel configured for 
transmitting a second media stream at a corresponding second compression is opened during the 
same call. 

5 One aspect of the present invention provides a method in a media server configured for 

providing telecommunications user interface services to a subscriber. The method includes 
establishing a call having a first media channel with an IP telephony gateway, the first media channel 
configured for transmitting a first media stream according to a corresponding first compression. The 
method also includes initiating closing of the first media channel based on a request for a resource 
10 utilizing a second compression, and starting for the call a second media channel, configured for 
transmitting a second media stream according to the second compression, upon closing the first 
O media channel. The initiation of closing the first media channel, and starting the second media 
channel, enables the media server to dynamically switch during a call between codecs having 
respective compression characteristics. Hence, the media server can provide advanced call services 
rt5 for the subscriber, enabling a subscriber to access during the same call different resources having 
I: [ respective codec configurations, such as a voicemail system utilizing low-bandwidth codecs such 

s_ as G.729A or G.723 at 8kbps, followed by access to a text-to-speech resource utiHzing a G.71 1 

O 

codec operating at 64kbps. 

fl Another aspect of the present invention provides a media server comprising a first interface 

□0 and a second interface. The first interface is configured for establishing with an IP telephony 
~" gateway a call having a first media channel configured for transmitting a media stream according 
to a first compression, the first interface configured for initiating closing of the first media channel 
and starting for the call a second media channel, configured for transmitting a second media stream 
according to a second compression, based on a request for a resource utilizing the second 
25 compression. The second interface is configured for receiving fi-om the resource a media stream 
having the second compression. Hence, the media server can supply media streams having different 
compression formats during the same call by closing one media channel and opening another media 
channel configured for a different compression. 
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Additional advantages and novel features of the invention will be set forth in part in the 
description which follows and in part will become apparent to those skilled in the art upon 
examination of the following or may be learned by practice of the invention. The advantages of the 
present invention may be realized and attained by means of instrumentalities and combinations 
particularly pointed out in the appended claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Reference is made to the attached drawings, wherein elements having the same reference 
numeral designations represent Uke elements throughout and wherein: 

Figure 1 is a block diagram illustrating a voice over IP system having a voice over IP 
gateway and a media server configured for dynamically switching codecs during a voice over IP call 
according to an embodiment of the present invention. 

Figure 2 is a flow diagram illustrating a voice over IP call that necessitates switching codecs 
during the call. 

Figure 3 is a diagram illustrating the method between the voice over IP gateway and the 
media server for dynamically switching codecs by closing and reopening media streams during a call 
according to an embodiment of the present invention. 

BEST MODE FOR CARRYING OUT THE INVENTION 
Figure 1 is a diagram illustrating a voice over IP user interface system configured for 
dynamically changing codecs during a voice over IP call according to an embodiment of the present 
invention. The system 10 includes an IP telephony gateway 12, implemented for example using a 
Cisco AS5300 Universal Access Server, configured for establishing Real Time Protocol (RTP) data 
stream connections according to H.323 protocol for a calling party 14 dialing into an IP-based 
resource via the public switched telephone network 16. The system 1 0 also includes a media server 
1 8, for example an IP-based unified messaging system configured for establishing unified messaging 
sessions with the calling party 14. An exemplary implementation of the unified messaging system 
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is the commercially available Cisco Unified Open Network Exchange (Cisco uOne) (Products 
UONE-FOUND 4. IS, and UONE- V0ICEMSG4. 1 S) from Cisco Systems, Inc. 

The media server 1 8 is configured for establishing H.323 calls with the IP telephony gateway 
12 on a first interface 40 by exchanging call control and signaling commands across an H.225 call 
control channel 20. In particular, the media server 18 is implemented as an H.323 compliant 
software resource configured for executing selected user interface applications, for example a unified 
messaging system. The media server 1 8 also is configured for sending and receiving channel control 
messages on an H.245 media control channel 22 setup and tear down of Real Time Protocol (RTP) 
media channels 24 configured for transmitting media streams according to a prescribed compression. 

Figure 2 illustrates a scenario where the media server 18 may need to dynamically switch 
codecs during a voice over IP call. Typically the media server 18 is accessed in step 30 for storage 
and retrieval of voice messages or messaging prompts from a message store 26 in step 32, for 
example an IMAP type directory that stores voice messages according to G.723 or G.729 
compression in order to conserve disk resources. In this case, the media server 18, having 
established an RTP media chaimel 24b for transmitting a media stream according to G.729 
compression, transfers the media stream via the media channel 24b to the IP telephony gateway 12; 
the IP telephony gateway, having activated its internal G.729 codec during the initial establishment 
of the RTP media channel 24b, generates analog speech signals using its internal G.729 codec based 
on the G.729 speech samples transferred in the media stream via the media channel 24b. 

In some cases, however, the media server 1 8 may need to transfer media data received on a 
second interface 42 from a text to speech resource 28 that generates speech according to a different 
compression, such as G.711. For example, the subscriber may request in step 34 that the media 
server 18 plays a stored e-mail message; in such a case, the media server 18 sends the text to a text 
to speech converter 28, configured for generating audio data using a higher data rate codec, for 
example a G.71 1 codec configured for generating audio samples at 64 kbps. Hence, the text to 
speech converter 28 outputs in step 38 G.71 1 -encoded media data to the media server 18, which is 
incompatible with the intemal G.729 codec activated by the IP telephony gateway 12 for the media 
channel 24b. One altemative would be that the media server 18 transcodes the media stream from 
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G. 7 1 1 compression to G. 729 compression; however, such an approach requires substantial processor 
resources within the media server 18 and hence does not scale well for large scale deployment. 

According to the disclosed embodiment, the media server is configured for dynamically 
switching codecs during the voice over IP call established by the H.225 call control channel 20 by 
closing the media channel 24b configured for transmitting G.729 media streams, and starting for the 
call a second media channel 24b' configured for transmitting a media stream according to the G.7 11 
compression utilized by the text to speech converter 28. In particular, the media server 1 8 sends a 
close logical channel message for its outbound RTP channel 24b via the H.245 media control 
channel 28. After the IP telephony gateway 12 acknowledges to the close logical channel message, 
the media server 18 sends an open logical channel message on the H.245 media control channel 28, 
specifying that a G.71 1 codec be enabled by the IP telephony gateway 12. Once the IP telephony 
gateway 12 acknowledges the open logical channel message, the media server 18 is ready to play 
the output from the text to speech server 28 according to the G.711 compression. Note that the 
inbound media channel 24a (i.e., from the IP telephony gateway 12 to the media server 18) may 
continue to use G.729A compression if the IP telephony gateway 12 supports asymmetric codecs. 
Once the text to speech play is completed, or if the text to speech play is interrupted by the 
subscriber, the media server 1 8 can reverse the process and switch back to the G.729 A compression 
prior to playing prompts. 

Figure 3 is a diagram illustrating the method of dynamically switching codecs by the media 
server 18 according to an embodiment of the present invention. The steps may be implemented as 
executable code stored on a tangible medium, for example a disk drive, a compact disc, or a 
transmission line medium. The method begins by the IP telephony gateway 12 sending a setup 
message in step 50 to the media server 18 on the H.225 call control channel 20 in response to an 
incoming call from the subscriber 14. The media server 18 in response sends on the H.225 call 
control channel 20 a connect message back to the IP telephony gateway 12 in step 52. The IP 
telephony gateway 12 and the media server 18 then exchange capabilities information in step 54 to 
determine which codecs are available for use. 
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The IP telephony gateway 1 2 and the media server 1 8 each then send an open logical channel 
message to establish a corresponding RTP media channel 24 for full duplex communications. In 
particular, the IP telephony gateway 12 sends a logical channel open message in step 56a on the 
H.245 media control channel 22, and the media server 18 responds by sending an acknowledgment 
5 in step 58a, causing the IP telephony gateway 12 to open the media channel 24a in step 60a using 
for example the G.729 compression. The media server 1 8 also sends a logical channel open message 
in step 56b on the H.245 media control channel 22, and the IP telephony gateway 12 responds by 
sending an acknowledgment in step 58b, causing the media server 1 8 to open the media channel 24b 
in step 60b using the G.729 compression. At this point, the IP telephony gateway 12 and the media 
1 0 server 1 8 have established a full duplex, two-way RTP channel (24a and 24b) using preferred voice 

mail compression, such as G.729. 
Q Assume now that the messaging subscriber 1 4 requests use of the text to speech resource 28, 

^1 configured for utilizing a second compression such as G.7 1 1 . The media server 1 8 determines that 
^2 a codec switch is necessary, and initiates the text to speech conversion process by sending the text 
ffl5 to the text to speech resource. The media server 1 8 also sends via the H.245 media control channel 
^{ 22 in step 62 a close logical channel message for the outbound RTP channel 24b. The IP telephony 
^ gateway 1 2 acknowledges the close logical channel message in step 64, causing the media server 1 8 
'%j to send another H.245 open logical channel message in step 66 that specifies that a G.71 1 codec be 
enabled in the IP telephony gateway 12. The IP telephony gateway 12 enables its internal G.71 1 
□0 codec, and acknowledges the open logical channel message via the H.245 media control channel 22 
^ in step 68. Once the media server 1 8 receives the acknowledgment, the media server 1 8 starts in step 
70 the new RTP media channel 24b' configured for transmitting the G.71 1 media stream fi-om the 
text to speech resource 28 to the IP telephony gateway 12. 

According to the disclosed embodiment, a voice over IP system is configured for supporting 
25 multiple codecs and switching between codecs during a call. Hence, a media server can utilize 
different third party resources that have native support only for certain codecs such as G.711, 
increasing the flexibility of IP based unified messaging systems that need to support applications that 
require different compression formats. Moreover, the ability to switch between codecs during a call 
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eliminates the necessity for transcoding from G.711 to G.729 in real-time; hence, processor 
resources are conserved, increasing the scalability of the messaging system. Moreover, the 
switching between codecs during a call has a lower latency than transcoding, providing a better user 
experience while the user is listening to text-based messages such as e-mails. 
5 While this invention has been described in connection with what is presently considered to 

be the most practical and preferred embodiment, it is to be understood that the invention is not 
Hmited to the disclosed embodiments, but, on the contrary, is intended to cover various 
modifications and equivalent arrangements included within the spirit and scope of the appended 
claims. 
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